Santa Cruz Predictive Data Grouping Using Successor
نویسندگان
چکیده
Latency is an ever-increasing component of data access costs, which in turn are often the bottleneck for modern high performance systems. The ability to predict future data accesses is essential to any attempt at addressing this problem, and we present a novel model for gathering and utilizing data access predictions. Prior attempts to utilize access predictions have taken the form of a single predictive engine attempting to preemptively fetch data. We offer a more powerful model that separates the process of access prediction from the data retrieval mechanism. Predictions are made on a per-file basis and used to provide a minimal amount of additional metadata, which in turn is used by a grouping mechanism to automatically associate related items. This approach allows truly opportunistic utilization of predictive information, with little of the timing restrictions of prior approaches. Our research covers access prediction, grouping based on predictions, and a discussion of predictability and its meaning in the context of I/O behavior. We present two predictors: Noah, named for its prediction of pairs, and Recent Popularity, a majority voting mechanism. We distinguish the goal of predicting the most events accurately (general accuracy) from the goal of offering the most accurate predictions (specific accuracy). Both predictors can trade the number of events predicted for accuracy. Trace-based evaluation demonstrates that their error rates can be adjusted to less than 2% for more than 60% of all access requests. Predictions are used to provide a minimal amount of per-file additional metadata, which is then used separately by our grouping mechanism. To demonstrate the usefulness of grouping, we present the aggregating cache which manages distributed file system caches based upon groups built from our successor predictions. We present trace-driven results demonstrating that grouping can reduce LRU demand fetches by 50% to 60%. If we consider the effects of intervening caches we observe dramatic gains for our predictive cache. Our treatment includes information theoretic results that justify our approach, a graphical explanation of the effects of caches on workload predictability (cachefrequency plots), as well as relative predictor performance (rank-difference plots).
منابع مشابه
Sanitary Wastewater Supplemented with Glycerol to Obtain Lipid-Rich Microalgal Biomass
Introduction: Mixotrophic microalgae systems have great potential for bioenergy production and wastewater treatment. Anaerobic-treated wastewater supplemented with carbon can improve biomass yield and quality, as it presents low carbon content. Alternative carbon sources in microalgae cultivation, such as glycerol, are essential for minimizing the economic and environmental impacts caused by bi...
متن کاملMorphological and Molecular Characteristics of Sarcocystis aucheniae Isolated from Meat of Guanaco (Lama guanicoe)
Background: Sarcocystosis in South American camelids (SAC) is an important parasitic disease which results in economical loss due to carcass condemnation. Meat products from camelids are significant source of animal protein in several American countries. Sarcocystis spp. producing macroscopical cysts in these animals have been nominated as S. aucheniae, S. tilopodi, and S. guanicoecanis. The ai...
متن کاملStrengthening with Blood Flow Restriction: Can it be a Useful Option in the Rehabilitation of Patients with Coronavirus?
It is necessary to have a greater understanding of COVID-19, and studies with an adequate design must be performed to be able to make treatment and rehabilitation recommendations with a sufficient degree of evidence. However, for patients discharged from the hospital who have been cured of the infection and who have significant functional impairment, Blood flow restriction (BFR) strengthening m...
متن کاملThe Power of Classic Music to Reduce Anxiety in Rats Treated with Simvastatin
Introduction: This study was designed to investigate the effects of music in Wistar rats after sub-chronic treatment of simvastatin. The rats were orally administered with either simvastatin or saline (controls). After 4 weeks of drug treatment, the rats were selected for behavioral studies. The rats were exposed to music 24 hours before behavioral tests (Mozart’s piano sonata, KV361, Largo). R...
متن کاملDNA damage in dental pulp mesenchymal stem cells: An in vitro study
The aim of this study was to evaluate the potential use of a DNA comet assay, DNA fragmentation fluorimetric assay and reactive oxygen species levels as potential biomarkers of genome conditions of dental pulp stem cells (DPSCs) isolated from dog canine teeth. Mesenchymal stem cells were isolated from the dental pulp collected from dog teeth. The results obtained suggest the ideal moment for cl...
متن کامل